Picture for Pengwei Wang

Pengwei Wang

Latent Reasoning VLA: Latent Thinking and Prediction for Vision-Language-Action Models

Add code
Feb 01, 2026
Viaarxiv icon

RoboBrain 2.5: Depth in Sight, Time in Mind

Add code
Jan 20, 2026
Viaarxiv icon

Action-Sketcher: From Reasoning to Action via Visual Sketches for Long-Horizon Robotic Manipulation

Add code
Jan 04, 2026
Viaarxiv icon

RoboMirror: Understand Before You Imitate for Video to Humanoid Locomotion

Add code
Dec 30, 2025
Viaarxiv icon

Do You Have Freestyle? Expressive Humanoid Locomotion via Audio Control

Add code
Dec 29, 2025
Viaarxiv icon

Robo-Dopamine: General Process Reward Modeling for High-Precision Robotic Manipulation

Add code
Dec 29, 2025
Viaarxiv icon

RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics

Add code
Dec 15, 2025
Viaarxiv icon

PIGEON: VLM-Driven Object Navigation via Points of Interest Selection

Add code
Nov 17, 2025
Figure 1 for PIGEON: VLM-Driven Object Navigation via Points of Interest Selection
Figure 2 for PIGEON: VLM-Driven Object Navigation via Points of Interest Selection
Figure 3 for PIGEON: VLM-Driven Object Navigation via Points of Interest Selection
Figure 4 for PIGEON: VLM-Driven Object Navigation via Points of Interest Selection
Viaarxiv icon

GridPrune: From "Where to Look" to "What to Select" in Visual Token Pruning for MLLMs

Add code
Nov 13, 2025
Viaarxiv icon

RoboOS-NeXT: A Unified Memory-based Framework for Lifelong, Scalable, and Robust Multi-Robot Collaboration

Add code
Oct 30, 2025
Viaarxiv icon